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(54) File portability techniques 

(57) Systems and methods for building a platform 
specific compiler in a multi-platform environment are 
provided. A set of user defined platform dependent com- 
piler architecture descriptors that describe correspond- 
ing architectural features of a particular hardware plat- 
form dependent compiler are provided. The descriptors 
are converted into the platform dependent compiler 



source code which is compiled into platform dependent 
compiler object code. The platform specific compiler is 
formed from the platform dependent compiler object 
code and platform independent compiler object code al- 
ready provide. During compiler run time an interface me- 
diates the flow of information between the platform de- 
pendent compiler object code and the platform inde- 
pendent compiler object code. 
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Description 

BACKGROUND OF THE INVENTION 

1 . Field of Invention 

[0001] The invention relates generally to computer 
systems. More particularly, methods and apparatus for 
porting software between different computing platforms 
are disclosed. 

2. Description of Relevant Art 

[0002] The continuing proliferation of software plat- 
forms and hardware architectures ensures that both 
computer users and computer program developers will 
encounter many different computing environments in 
the course of their careers. It should be noted that in the 
context of this discussion, the term environment refers 
to the complete range of elements in a computing sys- 
tem that interact with the ported software. These ele- 
ments typically include a processor and operating sys- 
tem as well as I/O devices, libraries, networks, or, in 
some cases, a larger human or physical system. Even 
though a fow quasi -standard platforms (e.g. IBM-PC, 
UNI X) have become widely used there, as yet, is no uni- 
versal computing environment. In order to maintain and 
expand their viability, therefore, most software pro- 
grams will eventually face the need to be ported, such 
that an executable version of the software program 
based on the existing version is created in the new com- 
puting environment. Portability, or the ability of a soft- 
ware program to be ported to a given environment (i.e., 
the target) is, therefore, becoming universally recog- 
nized as a desirable attribute for most software pro- 
grams. It is clear, therefore, that portability between dif- 
ferent computing platforms enhances the value of a soft- 
ware program both by extending its useful lifecycle and 
by expanding the range of installations in which it can 
be readily used. As is well known in the art, a software 
program can include an application program, a system 
program, or a component of a program whereas a soft- 
ware system is a collection of software programs. 
[0003] Porting of a software program is useful to the 
degree that the cost of porting is less than the cost of 
rewriting the program in the new target environment. A 
software program would be perfectly portable if it could 
be ported al zero cost and, of course, this is never pos- 
sible in practice. In practice there are two basic porta- 
bility protocols, the first being binary portability (i.e., 
porting the executable form of the software program) 
and the second being source portability (i.e. , porting the 
source language representation of the software pro- 
gram). Although binary portability protocols typically of- 
fer several advantages (related primarily to ease of port- 
ing) it can only be used to port software programs across 
strongly similar environments thereby severely limiting 
its usefulness. In contrast, since source portability pro- 



tocols assume availability of a source code, they typi- 
cally provide a greater ability to adapt a particular soft- 
ware program to a wider range of computing environ- 
ments. 

5 [0004] The porting process in general has two princi- 
pal components referred to as transportation and adap- 
tation. In the context of the porting process, the trans- 
portation component is defined primarily as the physical 
movement of the software program between the various 
10 different computing platforms. Although this may seem 
to be a trivial task, in many cases it is not trivial since 
compatible media must be used and various types of 
representation conversions may also be required. On 
the other hand, the adaptation component is any modi- 
15 fication that must be performed on the original version 
that includes both mechanical translation such as by 
language processors, and manual modification by hu- 
mans both of which incur costs in terms of capital equip- 
ment and man hours in the form of increased develop- 
20 menl costs. 

[0005] In addition to the increase in development 
costs, there remains the possibility of a reduction in 
some quality measures of the actual software such as 
performance, or conformance to system -specific user- 
25 interface conventions. However, the corresponding 
benefits typically take the form of reduced costs to pro- 
duce and maintain future implementations of the partic- 
ular software program, as well as possible quality im- 
provements in factors such as reliability. 
30 [0006] Unfortunately, most of the porting process is 
still done by ad hoc methods that result in inefficient 
techniques that add substantially to the costs of porting 
software from one platform to another. By way of exam- 
ple, a compiler translates a computer program from one 
35 language into another, catching any errors in syntax 
along the way. Most commonly, a compiler translates 
some high level language, such as C++ or COBOL, into 
optimized machine language such that a computer can 
understand without any translation. In order to fully port 
40 a compiler, therefore, several tasks must be accom- 
plished in order for the ported compiler to be able to suc- 
cessfully, and in a highly reliable manner, perform its de- 
signed functions while operating in a totally different 
platform than the one in which it was originally con- 
45 ceived. 

[0007] The several tasks required to be accomplished 
in order to fully port a compiler include customization of 
instruction selection, register albcalion, instruction 
scheduling, instruction peephole optimization, calling 
so conventions, frame layout, runtime issues, instruction 
encoding, and "clues" for optimization for the different 
platform. Current compiler porting techniques have only 
been able to efficiently perform the first four tasks leav- 
ing the remaining five tasks for humans to accomplish 
manually. Typically, in order to accomplish the remaining 
five tasks requires that anywhere from approximately 
30,000 lines of code to as much as 100,000 lines of code 
must be written by hand which can take up to a full man 
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year to accomplish. This is obviously very inefficient, not 
cost effective, and leaves the entire porting process 
prone to error, both human and otherwise. 
[0008] Therefore, what is desired is the capability of 
porting software programs, including compilers, from 
one platform to another different platform in a cost ef- 
fective and resource efficient manner. 

SUMMARY OF THE INVENTION 

[0009] Broadly speaking, the invention relates to an 
improved method, apparatus and computer system for 
efficiently porting a software program, including in one 
implementation a compiler, from one computing plat- 
form to another different computing platform. The inven- 
tion can be implemented in numerous ways, including 
as a method, a computer system, an apparatus, and a 
computer readable medium. Several embodiments of 
the invention are discussed below. 
[0010] According lo one aspect of the present inven- 
tion, an apparatus for compiling a platform specific com- 
piler is described. The apparatus includes a set of user 
defined platform dependent compiler architecture de- 
scriptors that describe corresponding architectural fea- 
tures of a particular hardware platform. An architecture 
descriptor compiler converts the user defined platform 
dependent compiler architecture descriptors into the 
platform dependent compiler source code which is con- 
verted into platform dependent object code by a host 
compiler. During run-time for the platform specific com- 
piler, an interface mediates the flow of information be- 
tween platform dependent compiler object code and 
platform independent compiler object code. 
[0011] As a method for building a platform specific 
compiler, a set of user defined platform dependent com- 
piler architecture descriptors that describe correspond- 
ing architectural features of a particular hardware plat- 
form dependent compiler are provided. The descriptors 
are converted into platform dependent compiler source 
code by an architecture descriptor compiler. The plat- 
form dependent compiler source code is compiled into 
platform dependent object code. The platform specific 
compiler is formed form the platform dependent object 
code and platform independent compiler object code al- 
ready provided. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0012] The invention, together with further advantag- 
es thereof, may best be understood by reference to the 
following description taken in conjunction with the ac- 
companying drawings in which: 

Fig. 1 is a representative block diagram of a multi- 
platform compiler system in accordance with an em- 
bodiment of the invention; 

Fig. 2 illustrates a particular implementation of the 
multi-platform compiler shown in Fig 1 ; 



4 

Fig. 3 shows another implementation of the compil- 
er shown in Fig. 2; 

Fig. 4 shows a flowchart detailing a compiler build- 
ing process in accordance with an embodiment of 
5 the invention; 

Fig. 5 illustrates a Java Virtual Machine (JVM) hav- 
ing a platform specific compiler in accordance with 
an embodiment of the invention; 
Fig. 6A illustrates an exemplary AD file organization 
in accordance with an embodiment of the invention; 
Fig. 6B illustrates a particular relationship between 
various data fields included in the AD file shown in 
Fig. 6A; 

Fig.7 illustrates an exemplary interface coupling 
platform dependent source code and platform inde- 
pendent source code in accordance with an embod- 
iment of the invention; 

Fig. 8 is an exemplary representation of a platform 
specific run time process in accordance with an em- 
bodimenl of the invention; and 
Fig. 9 illustrates a computer system employed to im- 
plement the invention. 

[0013] In another embodiment, a platform specific 
compiler is disclosed. The compiler includes platform 
dependent compiler object code and platform independ- 
ent compiler object code which are suitable for execu- 
tion on a particular hardware platform. An interface that 
is partially embedded in the platform independent object 
code and partially embedded in the platform dependent 
object code mediates flow of information between the 
platform independent compiler code and the platform 
dependent compiler object code during platform specific 
compiler run time. 

[001 4] These and other advantages of the present in- 
vention will become apparent upon reading the following 
detailed descriptions and studying the various figures of 
the drawings. 

DETAILED DESCRIPTION OF THE EMBODIMENTS 

[0015] In the following description, frameworks and 
methods of porting a software program designed to run 
on an original computing platform to any numberof other 
different computing platforms are described. The inven- 
tion will initially be described in terms of a compiler re- 
siding in a Java virtual machine. In general, in order to 
port a compiler that was designed to run on a first oper- 
ating system to a second, different operating system (i. 
e., also referred to as the target platform), a user defined 
platform specific architecture descriptors in the form of 
an architecture description (AD) file is provided. In the 
described embodiment, the AD file includes all target 
platform architectural information required to describe 
the platform specific compiler. The AD file forms an input 
to a multi-platform compiler that includes an architecture 
design language compiler (ADLC). In one embodiment, 
the ADLC is used to generate particular target platform 
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dependent source code used to build the platform spe- 
cific compiler. The platform specific compiler is built us- 
ing platform dependent source code provided by the 
ADLC and target independent source code otherwise 
provided. . 

[0016] In this way, the amount of manual coding re- 
quired to define the platform specific compiler is sub- 
stantially reduced. The multi-platform compiler system 
in conjunction with the architecture design file particular 
to a specific target platform substantially reduces the 
time required to port a compiler from one platform to an- 
other different platform by substantially reducing the re- 
quired use of manual code generation. 
[0017] Fig. 1 is a representative compile-time block 
diagram of a multi-platform compiler system 100 in ac- 
cordance with an embodiment of the invention. The mul- 
ti-platform compiler system 100 is capable of building, 
or compiling, a platform specific compiler capable in it- 
self of compiling source code. Such source code can, 
and does include, for example, Java source code. In the 
described embodiment, the platform specific compiler is 
built by compiling both platform independent source 
code representing those portions of the platform specific 
compiler that are independent of any particular platform 
and platform dependent source code representing 
those features of the compiler that are platform specific. 
Typically, the only user supplied coding is that coding 
used to provide a properly structured AD file. 
[0018] More particularly, in the described embodi- 
ment, the multi-platform compiler system 100 includes 
• a multi-platform compiler builder 102 arranged to build 
the platform dependent compiler using platform specific 
architecture descriptors. In the described embodiment, 
the platform specific architecture descriptors take the 
form of an architecture description (AD) written in an ar- 
chitecture description language (ADL) used to represent 
those platform dependent portions of the target platform 
compiler. It should be noted that the ADL can take many 
forms well known to those skilled in the art such as C++, 
Attribute Grammars, Custom Description Languages, 
etc., or some combination of these forms. Typically, it is 
the AD file or files that are coded by the user of the multi- 
platform compiler system 100. however, in some cases, 
the AD tiles are provided by original equipment manu- 
facturers in situations referred to as "turn key" systems. 
In these situations, the end user has selected which tar- 
get platforms are deemed to be useful and the supplier 
has provided the necessary coding efforts. 
[0019] In many instances, the AD files are located in, 
for example, memory systems external to the multi-plat- 
form compiler builder 102 such as an AD file 104 and 
an AD file 106. It should be that although any number 
of AD files can bo provided, each specific to a particular 
platform, albeit, only one AD file at a time is processed 
by the multi-platform compiler builder 102. In this way, 
when source code specific to, for example, a platform 
type 1 is provided, the compiler builder 102 is capable 
of compiling source code into platform type 1 object 



code. For example, the AD file 104 can include ADL 
code used by the compiler builder 102 to build an X86 
compiler used to compile source code into X86 object 
code. As well known in the art. X86 object code are 
5 those instructions executable by an X86 microprocessor 
manufactured by the Intel Corporation of Santa Clara, 
CA. . 

[0020] Along the same lines, the AD file 106 can in- 
clude ADL code used by the compiler builder 102 to 
10 build, for example, a SPARC compiler used to compile 
source code into SPARC object code executable by a 
SPARC microprocessor. As well known in the art, 
SPARC object code are those instructions executable 
by an SPARC microprocessor used in computers man- 
15 ufactured by Sun Microsystems of Mountain View. CA. 
in this way, the multi-platform compiler builder 102 is ca- 
pable of automatically providing as many of the platform 
specific compilers as there are AD files available. In this 
way, the manual coding required in order to generate, 
20 jn ihese examples, an X86 compiler and a SPARC com- 
piler is greatly reduced since it is essentially limited to 
the effort used to write the code included in AD files 104 
and 106, respectively. 

[0021] Referring now to Fig. 2 illustrating a particular 
25 implementation of the multi-platform compiler builder 
102 shown in Fig 1 . In the embodiment shown, the com- 
piler builder 102 includes an architecture description 
language compiler 202 (ADLC) coupled to an AD file 
204. The ADLC 202 is arranged to compile the platform 
30 specific ADL code included in the AD file 204 into plat- 
form dependent compiler source code. The platform de- 
pendent compiler source code is, in turn, provided as 
input to a host compiler 203 coupled to the ADLC 202. 
In the described embodiment, the host compiler 203 is 
35 a C++ compiler well known to those skilled in the art. 
However, any compiler suitably arranged to compile 
source code to target specific compiler object code can 
be used. The host compiler 203 compiles the platform 
dependent compiler source code provided by the ADLC 
40 202 into platform dependent compiler object code which 
is stored, or otherwise made accessible, to platform de- 
pendent object code block 206 included in a compiler 
unit 208. The compiler unit 208 also includes platform 
independent object code representative of the platform 
45 independent features common to most compilers. In the 
described embodiment, the platform independent object 
code is stored in a platform independent object code 
block 210. In one embodiment, the platform independ- 
ent object code block 210 is a compilation engine 212 
50 that communicates with a platform independent inter- 
face 21 4. The compilation engine 21 2 can be embedded 
within the platform independent compiler object code. 
In some cases, the platform independent compiler ob- 
ject code is derived from platform independent compiler 
55 source code compiled by the host compiler 203. In other 
cases, such as the described embodiment, the platform 
independent compiler object code is already provided. 
[0022] During run time (execution), the platform inde- 
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pendent compiler object code, interacts with the plat- 
form dependent compiler object code to operate (i.e., 
compile) in a target dependent manner. In a particular 
implementation during runtime, the interface 214 medi- 
ates the flow of information between selected portions 
of the platform dependent object code and the compila- 
tion engine 212. 

[0023] By way of example, when an X86 microproc- 
essor is required, the user provides, for this example, 
the AD file 204 having stored therein the appropriate 
ADL code specific to the X86 platform. The AD file 204 
then supplies this X86 specific ADL code to the ADLC 

202 which converts it to X86 specific compiler source 
code. It should be noted that the ADLC 202 is a universal 
compiler capable of compiling any properly constructed 
AD tile in appropriate ADL code into corresponding plat- 
form specific compiler source code thereby eliminating 
any requirements that the user code, or in any way mod- 
ify the ADLC 202 or any portion thereof. Therefore, the 
only coding required of the user is thai required to pro- 
vide a properly constructed and verified AD file. 
[0024] Once the ADLC 202 has converted the X86 
specific ADL code to X86 compiler source code, the X86 
compiler source code is compiled by the host compiler 

203 to form X86 compiler object code that is associated 
with the code block 206. During run-time, when the com- 
pilation engine 212 determines that information from a 
specific portion of the platform dependent object code 
is required, the compilation engine 212 issues an infor- 
mation request. In a particular embodiment of the inven- 
tion, an example of this request is referred to as an emit 
call that is processed by the interlace 21 4. The interface 
214 manages all information transfers between the plat- 
form dependent object code in the block 206 and the 
compilation engine 212 in response to a particular re- 
quest, such as an emit call. 

[0025] In some cases, multiple platform dependent 
compiler source code files can be compiled into a com- 
piler unit 300 shown in Fig. 3. The compiler unit 300 is 
one implementation of the compiler unit 208 shown in 
Fig. 2 and, as such, should only be consider exemplary 
in nature. I n the described embodiment, the various plat- 
form dependent AD files are stored in a file stack 301. 
The tile stack 301 can be local to the multi-platform com- 
piler builder 102 or, in some cases, can be remotely lo- 
cated in, for example, databases, remote servers, etc. 
This arrangement is particularly well suited for applica- 
tions involving transferring data over coupled computer 
networks, such as the Internet, local area networks 
(LANs), and the like. This use of multiple AD files is par- 
ticularly advantageous since it provides the compiler 
unit 300 with the capability of operating, as needed, as 
any platform specific compiler represented by the cor- 
responding AD file. By way of example, the stack file 
301 includes platform specific compiler code files 302 
and 304 representing, for example, SPARC and X86 op- 
erating platforms. A selector unit (not shown) selects, 
for example, the AD file 302 which is ultimately compiled 



by the host compiler into SPARC object code and stored 
in a block 306. With this arrangement, any platform hav- 
ing its corresponding AD file included in the file stack 
301 can be selected to customize the compiler unit 300. 

5 [0026] The multi-platform compiler builder 102 builds 
a particular compiler, using in one embodiment, a proc- 
ess 400 detailed by the flowchart shown in Fig. 4. The 
process 400 begins at 402 by providing platform specific 
architecture descriptors in the form of ADL code stored 

10 in, for example, an AD file. The ADL code is then passed 
to an ADLC at 404 which converts the ADL code to plat- 
form specific compiler source code. As discussed 
above, platform independent compiler source code, al- 
ready provided, is stored in a platform independent corn- 
's piler source code file at 406. In some embodiments, plat- 
form independent compiler object code can be provided. 
The host compiler then builds the platform dependent 
compiler by compiling the platform dependent compiler 
source code concurrently and the platform independent 

20 compiler source code at 408. Once the platform specific 
compiler has been built at 410, the platform specific 
compiler is available for compiling source code into plat- 
form specific object code as needed. In a particular em- 
bodiment, during run-time, a platform independent in- 

25 tcrfacc partially embedded in both the platform depend- 
ent compiler object code and the platform independent 
compiler object code mediates the flow of information 
between the compilation engine and the platform de- 
pendent compiler object code. 

30 [0027] More recently, the Java programming lan- 
guage, an object-oriented language, has introduced the 
possibility of compiling output (called bytecode) that can 
run on any computer system platform for which a Java 
virtual machine (or bytecode interpreter) is provided. 

35 The Java virtual machine is designed to convert the . 
bytecode into instructions that can be executed by the 
actual hardware processor. Using this virtual machine, 
rather than being interpreted one instruction at a time, 
bytecodes can be recompiled at each particular system 

4p platform by, in some cases, a just-in-time (JIT) compiler. 
[0028] Fig. 5 illustrates an apparatus that includes a 
Java Virtual Machine (JVM) 500 incorporating the com- 
piler unit 208 in accordance with an embodiment of the 
invention. I n the described arrangement, a platform spe- 

45 cific AD file stack 502 coupled to the ADLC 202 includes 
a group of AD files each representing particular platform 
dependent compiler features. In the described embodi- 
ment, the AD file stack 502 includes the AD file 1 04 and 
AD file 106 representing the X86 and SPARC architec- 

50 tures, respectively. In the case where a number of dif- 
ferent AD files are included in the AD file stack 502, a 
selector unit (not shown) is typically used to select a par- 
ticular AD file from the AD file stack 502 corresponding 
to the desired operating platform. When the appropriate 

55 AD file is selected, the ADLC 202 converts the ADL code 
included in the selected AD file into appropriate platform 
dependent compiler source code as discussed above. 
[0029] In the Java programming language and envi- 
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ronment, a just-in-time (JIT) compiler is a program that 
turns Java bytecode into instructions thai can be sent 
directly to the processor. After a Java program has been 
written, the Java source language statements are com- 
piled by the Java compiler into Java bytecode rather 
than into code that contains instructions that match a 
particular hardware platform's processor {for example, 
an Intel Pentium microprocessor or an IBM System/390 
processor). The Java bytecode is platform-independent 
code that can be sent to any platform and run on that 
platform. 

[0030] More particularly, when bytecodes are provid- 
ed to a JIT compiler provided by the compiler 208, the 
compilation of methods contained in bytecodes 504 is 
delayed until the methods are about to be executed. 
When bytecodes 504 are provided to an interpreter 506, 
bytecodes 504 are read into interpreter 506 one byte- 
code at a time. Interpreter 506 then performs the oper- 
ation defined by each bytecode as each bytecode is 
read into interpreter 506. That is : interpreter 506 "inter- 
prets" bytecodes 504,. as will be appreciated by those 
skilled in the art. In general, interpreter 506 processes 
bytecodes 504 and performs operations associated with 
bytecodes 504 substantially continuously. 
[0031] When a method is interpreted, the sequence 
of bytecodes 504 are directly executed by interpreter 
506. If, on the other hand, the method which is invoked 
is a compiled method which has not been compiled, the 
platform specific compiler 208 is activated. The compiler 
208 then generates machine instructions from byte- 
codes 504, and the resulting machine-language instruc- 
tions may be executed directly by the target platform op- 
erating system 510. In general, the machine-language 
instructions are discarded when virtual machine 500 ter- 
minates. - 

[0032] Referring now to Figs. 6A and 6B, an organi- 
zation of an AD file 600 is shown in accordance with an 
embodiment of the invention. It should be noted that the 
organization shown is one of the many possible organ- 
izations that AD file 104 can take. In the described em- 
bodiment, the AD file 600 is a hierarchically organized 
set of distinct platform architecture descriptor data 
fields. By way of example, a register definition data field 
602 is used by the ADLC to describe individual registers 
and classes ot registers with the target architecture. An 
encoding block data field 604 specifies the encoding 
classes used by the target compiler to output byte 
streams. A frame management block data field 606 in- 
cludes information that defines the frame structure and 
management protocols. Such information defines, for 
example, what direction the frame stack grows, the 
number of stack slots consumed by a monitor enter op- 
eration, stack alignment requirements, number of stack 
slots reserved for "Top of Stack", amongst others. 
[0033] An operand data field 608 provides operand 
definitions that must precede instruction definitions for 
correct compilation in the ADLC since operands consti- 
tute user defined types which are used in instruction def- 



initions A pipeline rules data field 61 0 is provided to de- 
fine the behavior of the target architecture pipeline. An 
instruction definitions data field 61 2 provides instruction 
formats for the target architecture. A peephole data field 
5 614 provides target architecture specific optimization 
rules used by the ADLC. 

[0034] The hierarchical organization of the AD file 600 
underlies the interrelationship amongst the various ADL 
input data fields. Fig. 6B graphically illustrates one such 
10 relationship, specifically, the relationship between the 
various operands included in the instruction definitions 
data field 612. By performing a backwards traversal 
from using the instruction definitions data field 612 as 
* the root, the relational dependencies for the various op- 
is erands required to be input to the ADL input data field 
is determinable. For example, by performing an upward 
traversal starting from the instruction definition data field 
61 2 as the root and extending upward along the various 
branches, the pipeline rules data field 61 0 and operand 
20 definitions data field 608 are encountered. Performing 
an upward traversals from the from the pipeline rules 
data field 612 is the register definitions data field 602, 
while the register definitions data field 602 and the en- 
coding class data field 608 are encountered when an 
2S upwards traversal from the operand definitions data field 
608 is performed. In this way, the various operands re- 
quired to fully define a particular instruction definition is 
provided. 

[0035] Fig. 7 illustrates an exemplary interface 700 

30 used by the platform dependent compiler in accordance 
with an embodiment of the invention. The interface 700 
mediates communication between the compiler engine 
212 (or the platform independent object code) and the 
platform dependent object code in the block 206. In the 

35 embodiment shown, the ADLC output includes source 
code for a deterministic finite automaton (DFA) 702 that 
specifies the mapping from ideal operations to machine 
instructions. The ADLC output also includes .source 
code which defines a set of instruction classes 704 that 

40 are used to, for example, define legal register masks, 
encoding methods, branch offset behavior, etc. Source 
code for a peephole rules oracle706 that specifies ma- 
chine specific trees that are legal to optimize and what 
the correct replacement is for those trees is also output 

45 by the ADLC. In this way, the platform specific architec- 
ture characteristics are automatically provided in a for- 
mat suitable, after processing by the host compiler, for 
Ihe compilalion engine 212 lo use in compiling source 
code into platform dependent object code. 

so [0036] The target independent portion of the interface 
700 is coupled to a matcher 708 which generates input 
trees to be processed by the DFA 702 that performs bot- 
tom up rewrite rule system (BURS) style tree pattern 
matching in order to select machine instructions for ideal 
55 operations in the intermediate representation. 

[0037] The interface 700 also includes object code ar- 
range to act as a peephole DFA that performs tree pat- 
tern matching to find optimization candidates and re- 



BNSDOCID: <EP 10469B5A2J_> 



6 



11 



EP 1 046 985 A2 



12 



places matched trees of machine instructions with opti- 
mized trees of machine instructions. A matcher 708 per- 
forms instruction selection using the matcher DFA and 
builds machine specific intermediate representations. A 
scheduler 710 orders the machine specific intermediate 
representations while a register allocator 712 selects a 
legal assignment of registers to operands in the ma- 
chine specific representation. This includes the insertion 
of any instructions necessary to relocate values to prop- 
er locations (such as moving arguments to their appro- 
priate location specified by the calling convention). A 
peephole optimizer 714 pattern matches small trees in 
the machine specific representation and replaces them 
with more optimal machine specific trees. An object 
code output 716 uses virtual calls to encode the ma- 
chine specific representation as machine object code in 
a buffer and makes the buffer available to the virtual ma- 
chine. 

[0038] Fig. 8 is an exemplary representation of a plat- 
form specific run time process in accordance with an 
embodiment of the invention. During the run time proc- 
ess, the compilation engine 212, when required, gener- 
ates a request (referred to, in a particular embodiment, 
as an emit function call) for specific information stored 
in the platform dependent compiler object code block 
206 which in a particular embodiment represents the ob- 
ject code of a particular instruction on a particular plat- 
form. This request is processed through the interface 
700, and results in the execution of platform dependent 
object code which computes the applicable encoding 
value, and makes that value available to the compilation 
engine 212 through the interface 700. The compilation 
engine 21 2 then uses the value in a platform independ- 
ent manner. 

[0039] By way of example, when the scheduler 710 
requires specific information from the instruction class 
706, an information requestor 802 sends an emit call to 
an information retrieverS04. In response to the emit call, 
the information retriever 804 retrieves the specific infor- 
mation which is then returned to the information reques- 
tor 802 by way of the interface 700. In this way, any in- 
formation required by the platform independent object 
code during run-time is provided using the interface 700 
which is itself platform independent. 
[0040] Fig. 9 illustrates a computer system 900 em- 
ployed to implement the invention. The computer sys- 
tem 900 or, more specifically, CPUs 902, may be ar- 
ranged to support a virtual machine, as will be appreci- 
ated by those skilled in the art. As is well known in the 
art, ROM acts to transfer data and instructions uni-di- 
rectionally to the CPUs 902, while RAM is used typically 
to transfer data and instructions in a bidirectional man- 
ner. CPUs 902 may generally include any number of 
processors. Both primary storage devices 904, 906 may 
include any suitable computer-readable media. A sec- 
ondary storage medium 908, which is typically a mass 
memory device, is also coupled bi-directionally to CPUs 
902 and provides additional data storage capacity The 



mass memory device 908 is a computer-readable me- 
dium that may be used to store programs including com- 
puter code, data, and the like. Typically, mass memory 
device 908 is a storage medium such as a hard disk or 

s a tape which generally slower than primary storage de- 
vices 904, 906. Mass memory storage device 908 may 
take the form of a magnetic or paper tape reader or 
some other well-known device. It will be appreciated that 
the information retained within the mass memory device 

10 908, may, in appropriate cases, be incorporated in 
standard fashion as part of RAM 906 as virtual memory. 
A specific primary storage device 904 such as a CD- 
ROM may also pass data uni-directionally to the CPUs 
902. 

is [0041] CPUs 902 are also coupled to one or more in- 
put/output devices 910 that may include, but are not lim- 
ited to, devices such as video monitors, track balls, 
mice, keyboards, microphones, touch-sensitive dis- 
plays, transducer card readers, magnetic or paper tape 

20 readers, tablets, styluses, voice or handwriting recog- 
nizers, or other well-known input devices such as, of 
course, other computers. Finally, CPUs 902 optionally 
may be coupled to a computer or telecommunications 
network, e.g., an Internet network or an intranet net- 

25 work, using a network connection as shown generally 
at 912. With such a network connection, it is contem- 
plated that the CPUs 902 might receive information from 
the network, or might output information to the network 
in the course of performing the above-described method 

30 steps. Such information, which is often represented as 
a sequence of instructions to be executed using CPUs 
902, may be received from and outputted to the network, 
for example, in the form of a computer data signal em- 
bodied in a carrier wave. The above-described devices 

35 and materials will be familiar to those of skill in the com- 
puter hardware and software arts. 
[0042] Although only a few embodiments of the 
present invention have been described, it should be un- 
derstood that the present invention may be embodied 

40 in many other specific forms without departing from the 
spirit or the scope of the present invention. By way of 
example, the multi-platform compiler can be used in any 
computing system. 

[0043] Although the methods of porting a compiler 
^5 from one operating system to another different operat- 
ing system in accordance with the present invention are 
particularly suitable for implementation with respect to 
a Java™ based environment, the methods may gener- 
ally be applied in any suitable object-based environ- 
so ment. In particular, the methods are suitable for use in 
platform-independent object-based environments. It 
should be appreciated that the methods may also be im- 
plemented in some distributed object-oriented systems. 
[0044] While the present invention has been de- 
55 scribed as being used with a distributed object based 
computer system, it should be appreciated that the 
present invention may generally be implemented on any 
suitable computing system having a compiler. There- 
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fore, the present examples are to be considered as il- 
lustrative and not restrictive, and the invention is not to 
be limited to the details given herein, but may be modi- 
fied within the scope of the appended claims along with 
their full scope of equivalents. 



Claims 

1. An apparatus for generating a platform specific 
compiler, comprising: 

a set of user defined platform dependent com- 
piler architecture descriptors that describe cor- 
responding architectural features of a particular 
hardware platform dependent compiler; 
an architecture descriptor compiler arranged to 
convert the user defined platform dependent 
compiler architecture descriptors into the plat- 
form dependent compiler source code; 
a host compiler arranged to compile the plat- 
form dependent compiler source code into plat- 
form dependent compiler object code; 
platform independent compiler object code; 
and 

an interface arranged to mediate the flow of in- 
formation between the platform dependent 
compiler object code and the platform inde- 
pendent compiler object code during run time 
for the platform specific compiler 

2. An apparatus as recited in claim 1 , wherein the plat- 
form specific compiler includes platform independ- 
ent compiler object code and platform dependent 
compiler object code suitable for execution of the 
particular hardware platform. 

3. An apparatus as recited in claim 2, wherein during 
platform specific compiler run time, the platform in- 
dependent compiler object code requests specific 
platform dependent object code information by pro- 
viding an information request to the interface which 
directs the information request to a pre-determined 
platform dependent compiler object code informa- 
tion retriever. 

4. An apparatus as recited in claim 3, wherein the plat- 
form dependent compiler object code information 
retriever responds to the information request by re- 
trieving specific platform dependent compiler object 
code information in satisfaction of the information 
request. 

5. An apparatus as recited in claim 4. wherein the re- 
trieved information is provided to the interface 
which, in turn, directs the information to the infor- 
mation requestor. 



6. An apparatus as recited in claim 1 , wherein said ap- 
paratus comprises a plurality of sets of user defined 
platform, dependent architecture descriptors, 
wherein each of which corresponds to a different 

5 hardware platform. 

7. A method of building aplatform specific compiler, 
comprising: 

10 providing a set of user defined platform de- 

pendent compiler architecture descriptors that 
describe corresponding architectural features 
of a particular hardware platform dependent 
compiler; 

15 converting the set of user defined platform de- 

pendent compiler architecture descriptors into 
platform dependent compiler source code by 
an architecture descriptor compiler; 
compiling the platform dependent compiler 

20 source code into platform dependent object 

code by a host compiler coupled to the archi- 
tecture descriptor compiler; 
providing platform independent compiler object 
code, wherein the platform independent com- 

25 pilor object code and the platform dependent 

compiler object code are suitable for execution 
on the particular hardware platform: and 
forming the platform specific compiler from the 
platform dependent compilerobject code and 

30 the platform independent compiler object code. 

8. A method as recited in claim 7, further comprising: 

requesting specific platform dependent object 
35 code information by the platform independent 

compilerobject code by the platform independ- 
ent object code during platform specific compil- 
er run time; 

providing an information request to the inter- 
40 face; and 

directing the information request to a pre-deter- 
mined platform dependent compiler object 
code information retriever by the interface. 

45 9. A method as recited in claim 8, further comprising: 
retrieving specific platform dependent compil- 
er object code information in satisfaction of the in- 
formation request in response to the request by the 
information retriever. 

so 

10. A method as recited in claim 9, further comprising: 
directing the retrieved information to the infor- 
mation requestor by the interface. 

ss 11. A method as recited in claim 10, further comprising: 
directing the retrieved information to the infor- 
mation requestor by the interface. 
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12. A platform specific compiler, comprising: 

a platform dependent compiler object code; 
a platform independent compiler object code, 
wherein the platform independent compiler ob- s 
ject code and the platform dependent compiler 
object code are suitable for execution on a par- 
ticular hardware platform; 
an interface partially embedded in the platform 
independent code and partially embedded in io 
the platform dependent object code, wherein 
during platform specific compiler run time, the 
interlace mediates flow of information between 
the platform independent compiler code and 
the platform dependent compiler code. 

13. A compiler as recited in claim 12, wherein during 
platform specific compiler run time, the platform in- 
dependent compiler object code requests specific 
platform dependent object code information by pro- 20 
viding an information request to the interface which 
directs the information request to a p re-determined 
platform dependent compiler object code informa- 
tion retriever. 

25 

14. An apparatus as recited in claim 13, wherein the 
platform dependent compiler object code informa- 
tion retriever responds to the information request by 
retrieving specific platform dependent compiler ob- 
ject code information in satisfaction of the informa- 30 
tion request. 

15. An apparatus as recited in claim 14, wherein the re- 
trieved information is provided to the interface 
which, in turn, directs the information to the infor- 35 
mation requestor. 

16. An apparatus as recited in claim 12, wherein said 
apparatus comprises a plurality of sets of user de- 
fined platform dependent architecture descriptors, *o 
wherein each ot which corresponds to a different 
hardware platform. 
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